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Abstract —We design a critically-sampled compact-support 
biorthogonal transform for graph signals, via graph filterhanks. 
Instead of partitioning the nodes in two sets so as to remove 
one every two nodes in the interbank downsampling operations, 
the design is based on a partition of the graph in connected 
subgraphs. Coarsening is achieved by defining one “supernode” 
for each subgraph and the edges for this coarsened graph 
derives from the connectivity between the subgraphs. Unlike the 
“one every two nodes” downsampling on bipartite graphs, this 
coarsening operation does not have an exact formulation in the 
graph Fourier domain. Instead, we rely on the local Fourier 
bases of each subgraph to define filtering operations. We apply 
successfully this method to decompose graph signals, and show 
promising performance on compression and denoising. 

Index Terms —Graph signal processing, filterbanks, Laplacian 
pyramid, community detection, multiresolution, wavelet. 

1. Introduction 

Graphs are a modeling tool suitable to many applications 
involving networks, may they be social, neuronal, or driven 
from computer science, molecular biology [2]... Data on these 
graphs may be defined as a scalar (or vector) on each of 
its nodes, forming a so-called graph signal [3]. In a sense, 
a graph signal is the extension of the 1-D discrete classical 
signal (where the signal is defined on the circular graph, each 
node having exactly two neighbors) to any arbitrary discrete 
topology where each node may have an arbitrary number of 
neighbors. Temperature measured by a sensor network, age of 
the individuals in a social network, Internet traffic in a router 
network, etc. are all examples of such graph signals. 

Adapting classical signal processing tools to signals defined 
on graphs has raised significant interests in the last few 
years [3], [4]. For instance, the graph Fourier transform, the 
fundamental building block of signal processing, has several 
possible definitions, either based on the diagonalisation of 
one of the Laplacian matrices [5], or based on Jordan’s 
decomposition of the adjacency matrix [6], [7]. Building upon 
this graph Fourier transform, authors have defined different 
sampling and interpolation procedures [8]-[12], windowed 
Fourier transform [13], [14], graph empirical mode decom¬ 
position [15], different wavelet transforms, including spec¬ 
tral graph wavelets [5], [16], [17], diffusion wavelets [18], 
and wavelets defined via filterbanks [19]-[22]. Among the 
applications of graph signal processing, one may cite works 
on fMRI data [23], on multiscale community detection [24], 
image compression [25], etc. In fact, graph signal processing 
tools are general enough to deal with many types of irregular 
data [4]. 

Authors are with the Univ Lyon, Ens de Lyon, Univ Claude 
Bernard, CNRS, Laboratoire de Physique, F-69342 Lyon, France. Email: 
firstname.lastname@ens-lyon.fr. Work supported by the ANR-14-CE27-0001 
GRAPHSIP grant. A preliminary approach of this work was presented at 
Asilomar Conference on Signals, Systems, and Computers 2015 [1]. 


Graph filterbanks using downsampling have been initially 
defined for bipartite graphs [19] because: i) Bipartite graphs, 
by definition, contain two sets of nodes that are natural 
candidates for the sampling operations; ii) Downsampling 
followed by upsampling (which forces to zero the signal on 
one of the two sets of nodes) can be exactly written as a 
filter in the graph Fourier space. This enables to write exact 
anti-aliasing equations for the low-pass and high-pass filters to 
cancel the spectral folding phenomenon due to sampling [19]. 
However, for arbitrary graphs, one needs to decompose the 
graph in a (non-unique) sum of bipartite graphs [20], [21], 
[26], and analyze each of them separately. Another solution 
for arbitrary graph is based on downsampling according to the 
polarity of the graph Fourier mode of highest frequency [27]. 

We propose a significantly different way of defining filter- 
banks. Instead of trying to find an exact equivalent of both the 
decimation operator, hereafter (J^), and a filtering operator C, 
we directly define a decimated filtering operator L = (>1^)C; 
following here the notations of [28], where L is not to be 
confused with the Laplacian operator of the graph, noted C. 
Consider the 1-D straight-line graph where each node has two 
neighbors, and a partition of this graph in subgraphs of pairs 
of adjacent nodes. The classical Haar low-pass (resp. high- 
pass) channel samples one node per subgraph and defines on 
it the local average (resp. difference) of the signal. By analogy, 
we consider a partition of the graph in connected subgraphs, 
not necessarily of same size. Creating one “supernode” per 
subgraph, the low-pass channel (resp. high-pass channels) 
defines on it the local, i.e., over the subgraph, average (resp. 
differences) of the signal. The coarsened graphs on which 
those downsampled signals are defined, are then derived from 
the connectivity between the subgraphs: two supemodes are 
linked if there are edges between the associated subgraphs. 

With this approach, we design a critically-sampled, 
compact-support biorthogonal filterbank that is valid for any 
partition in connected subgraphs. Depending on the application 
at hand, one has the choice on how to detect such partitions. 
For compression and denoising, an adequate way is to use a 
partition in communities, i.e. groups of nodes more connected 
with themselves than with the rest of the network [29]. This 
community structure is indeed linked to the low frequencies 
of graph signals [24], [30]. For hierarchical clustering trees, 
multiresolution bases on graphs have been explored in [31]- 
[34]. As a difference here, we not only take into account a 
hierarchical clustering in groups, but also the local intra-cluster 
topology in each group when defining the analysis atoms. 

Section II recalls the definition of the graph Fourier trans¬ 
form we use. In Section III, after detailing the difficulties to 
extend classical filterbanks to graph signals, we discuss the 
state-of-the-art of graph filterbanks. The main contribution is 
in Section IV, first discussed as an analogy to the Haar fil- 
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terbank, before presenting fully the proposed graph filterbank 
design. Section V proposes how to obtain a relevant partition 
in connected subgraph. Section VI shows applications, in 
compression and denoising. We conclude in Section VII. 



a) 


II. The Graph Fourier Transform 

Let Q = (V,f,A) be a undirected weighted graph with 
V the set of nodes, £ the set of edges, and A the weighted 
adjacency matrix such that A^j = Aji > 0 is the weight of 
the edge between nodes i and j. Let N be the total number of 
nodes. Let us define the graph’s Laplacian matrix C = T> — A 
where D is a diagonal matrix with = '^j^i ^ij 

the strength of node i. C real symmetric, therefore diago- 
nalizable: its spectrum is composed of ^ its set of 

eigenvalues that we sort: 0 = Ai < A 2 < A 3 < ••• < 
A at; and of Q the matrix of its normalized eigenvectors: 
Q = (^i|^ 2 | • • • Considering only connected graphs, the 
multiplicity of eigenvalue Ai = 0 is 1 [35]. By analogy to 
the continuous Laplacian operator whose eigenfunctions are 
the continuous Fourier modes and eigenvalues their squared 
frequencies, Q is considered as the matrix of the graph’s 
Fourier modes, and at associated “fre¬ 

quencies” [3]. For instance, the graph Fourier transform x of 
a signal x defined on the nodes of the graph reads: x = x. 


III. State of the art 


A. Classical 1-D filterbank and the Haar filterbanks 

In classical setting, the decimation (>1^ 2) operator by 2 is 
paramount. It keeps one every two nodes and follows what we 
call the “one every two nodes paradigm”, as seen on Fig. la). 
The classical design of a filterbank is to find a set of operators, 
e.g. a low-pass filter C and a high-pass filter D, that combine 
well with decimation such that perfect recovery is possible 
from the decimated low- and high-pass filtered signals [28]. 

The usual Haar filterbank will be used as a leading example 
to expose the main issues encountered when attempting to 
extend filterbanks to graph signals. Consider the 1-D discrete 
signal X of size N. Let us recall that, at the first level of the 
classical Haar filterbank, x is decomposed into [28]: 

- its approximation Xi of size N/2\ Xi = 2)Cx, where 

C is the sliding average operator, here in matrix form: 


C = 


1 

71 


110 0 
0 110 
0 0 11 


( 1 ) 


- its detail X 2 of size N/2: X 2 = (>1^ 2)Dx^ where D is the 
sliding difference operator: 



-110 0 
0-110 
0 0-11 


( 2 ) 


This Haar filterbank is orthogonal and critically sampled. 
Our objective is to generalize it to signals on arbitrary graphs. 



Fig. 1. State-of-the-art graph downsampling procedure following the “one 
every two nodes” paradigm (one set in black, the other in gray): a) (resp. b) the 
classical ID (resp. 2D) downsampling; c) bipartite graph downsampling [19]; 
d) polarity downsampling from the highest frequency graph Fourier mode [27]. 


B. Adapting filterbanks to graph signals 

For graph signals, a key difficulty is the design of a suitable 
decimation operator, and it comes in two separate problems: 

i) how to choose the nodes to keep? 

ii) how to wire together the nodes that are kept, so as to 
create the downsampled graph? 

On a straight line or a regular grid, issue i) is solved by the 
one every two nodes paradigm and issue ii) does not exist: 
the structure after downsampling is exactly the same as the 
original (straight line or 2D grid); see Figs. 1 a) and b). 

To tackle these issues, the following works propose a way 
to adapt the one every two nodes paradigm to arbitrary graphs. 

1) A design for signals defined on bipartite graphs: Narang 
and Ortega [19], [26] consider first signals defined on bipartite 
graphs (i.e. two-colourable graphs). In this particular case, one 
may still downsample the graph by naturally keeping one every 
two nodes, as shown in Fig. Ic). For non-bipartite graphs, 
they develop a preprocessing step of the structure where the 
graph is decomposed in a sum of bipartite subgraphs, on 
which the filterbank is successively applied. Other methods 
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to create bipartite graphs have been proposed, by oversam¬ 
pling [20], or with maximum spanning tree [21]. For issue 
ii), the downsampled structure has edges between two nodes 
if they have at least a common neighbor in the initial graph. 
As seen in Fig. Ic), a downsampled bipartite graph is not 
necessarily bipartite and the preprocessing of the structure is 
mandatory to iterate the filterbank design. Still, an interesting 
property of this design is the specific behavior of bipartite 
graphs Laplacian’s eigenvalues, that enables the authors to 
write specific anti-aliasing equations for the design of the 
filters C and D (see Eq. (13) of [19]). 

2) A downsampling based on the Laplacian ’s last eigenvec¬ 
tor: Shuman et al. [27] focus on the eigenvector associated 
to the Laplacian’s largest eigenvalue. They create two sets 
of nodes depending on this eigenvector’s sign, as illustrated 
in Fig. Id). According to the Fourier interpretation of the 
Laplacian’s eigenbasis, the last eigenvector corresponds to the 
“highest frequency” of a graph signal. This idea, inspired 
by graph coloring studies [36], generalizes the fact that, 
for structured grids and bipartite graphs, the sign of this 
eigenvector does alternate every two nodes. To tackle issue 
ii), the authors in [27] rely on the Kron reduction [37] of the 
initial graph to obtain new graphs, and post-process them to 
remove links from otherwise very dense downsampled graphs, 
implying some degree of arbitrary choices. 

In summary, there are many choices (and some stochas- 
ticity) in the pre- or post-processing steps to obtain suitable 
decimated graphs on which the filterbank can be cascaded. 

IV. Filterbanks on connected subgraphs 

The idea we explore lets go of the “one every two nodes” 
paradigm, and concentrates on graph coarsening: given a 
partition in connected subgraphs of the initial graph, the 
approximation and detail(s) will be obtained on each “supem- 
ode” that represent each connected subgraph. Hence we will 
not attempt to define separately analogies of downsampling 
2 ) and filtering C and D. Instead, we directly define 
analogies to graph signals of the decimated sliding average 
operator L = 2)C and of the decimated sliding difference 

operator B = 2)D. They read for the Haar filterbanks: 

■ 1 1 0 0 0 0 ... ■ 

1 0 0 1 1 0 0 ... 

L = ^ 0 0 0 0 1 1 ... (3) 

and: 

-1 1 0 0 0 0 

0 0-1100 
0 0 0 0 -1 1 

(4) 

In Section IV-A, we first take a close look at the effect of these 
two Haar operators L and B on the input signal x, to give 
insight in the fundamental analogy that is further formalized in 
Sections IV-B to IV-D. In Section IV-E, we detail the analysis 
atoms created by the proposed filterbanks. 





A. Introducing the design by analogy to the Haar filterbank 

Let us rephrase the Haar filterbank from the proposed new 
point of view of operators on connected subgraphs. 

1) Replace decimation by partition: Consider the 1-D clas¬ 
sical signal X of even size N defined on the straight line graph 
Q, of size N, where each node has two neighbors. We consider 
the partition c of this graph in AT = N/ 2 connected subgraphs 

connecting neighbors two-by-two: we call it 
the Haar partition, and it reads (when coded as a vector): 

c= (1,1,2,2,3,3,---(5) 

where c{i) is the label of node i’s subgraph. 

2) Interpret operators L and B in terms of local Fourier 

modes: Consider subgraph and x^^^ the restriction of x 
to this subgraph. Define local adjacency matrix : 

VA:g{1,...,7V/2} J J . (6) 

Its Laplacian matrix is diagonalisable with two local Fourier 
modes: ^ (1, 1) of associated eigenvalue = 0 

and ^ 2 ^^^ = ^ (—1, 1) of associated eigenvalue = 2. 

The actual effect of the operation xi = Lx in Haar 
filterbank is to assign to each subgraph the first local 
Fourier component oi x^^^ : 

\/k G {1,..., A^/2} Xi{k) = x^^\ (7) 

Similarly, the actual effect of the operation X 2 = Bx may 
be rewritten as : 

\fk G {1,..., A^/2} X2{k) = q2^^^x^^\ (8) 

In other words [xi{k) X 2 {k)]'^ is the local (reduced to 
Fourier transform of x^^\ 

3) Analogy for graph signals: Consider a graph Q and 
a partition c of this graph in K connected subgraphs 

^ Consider one of these subgraphs of size 

N^. Consider x^^^ the local graph Fourier transform of x^^\ 
the graph signal reduced to To this end, we diagonalize 
0^^^’s local Laplacian matrix to find its eigenvectors 
(a.k.a. local Fourier modes) sorted w.r.t. their eigenvalues 
and compute the successive inner products. We propose the 
following fundamental analogy: the first coefficient of x^^^ 
will contribute to the approximation Xi of the signal, and the 
following coefficients to its successive details X 2 ^. • • ^xn^- 

4) Graph support of the decimated components: For 1-D 
Haar filterbanks, the two components are defined on straight- 
line graphs of size N/2. For arbitrary graphs, all subgraphs 
have not necessarily the same size. This implies that only 
subgraphs of size at least I contribute to xi. For instance, a 
three-node subgraph will have three eigenvectors for its 
local Laplacian and its first (resp. second, third) eigenvector 
will contribute to Xi (resp. X 2 , xf). 

The proposed analogy naturally defines the graphs on which 
are defined the downsampled signals xi. Let us introduce a 
supernode k standing for each subgraph For / = 1, the 
approximation signal Xi lies naturally on a graph of adjacency 
matrix Ai where Ai{k,k') is the sum of the weights of the 
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edges connecting subgraph k to subgraph k' in the original 
graph. Then, for the subsequent graph of adjacency matrix 
Ai on which is defined the detail signal Xi, only supernodes 
standing for subgraphs of size at least I are needed and Ai is 
defined the same way by summing the edges between involved 
subgraphs. We formalize this analogy in the following. 

B. Formalization of the operators necessary to the design 

To help the assimilation of the definitions introduced here, 
the reader may in parallel look at Appendix A, where a trivial 
concrete example is fully detailed. 

1) Subgraph sampling operators: Consider an arbitrary 

graph Q and an arbitrary partition c of this graph in K 
connected subgraphs | . Write Nk the number 

of nodes in subgraph and T^^^ C V the list of nodes 
in For each subgraph let us define the sampling 

operator G such that: 

c«(i,j) = iifrW(i) = i, 

= 0 if not. 

Applying to a graph signal x, one obtains the 

graph signal reduced to subgraph i.e. x^^\ Conversely, 
applying expands a signal defined on to a graph 
signal on Q with zero-padding. 

The adjacency matrix of noted G satisfies: 

Mk e {1, K) (10) 

As a consequence, the intra-subgraph adjacency matrix Ai^t, 
that is, the adjacency matrix of the graph that contains only 
the intra-subgraph edges and none of the inter-subgraph edges, 
may be written as: 

Aint = ( 11 ) 

k=l 

The complement to obtain the full adjacency matrix A is 
called the inter-subgraph adjacency matrix and is defined as 
Aext = A—Aint', it keeps only the links connecting subgraphs 
together. 

2) Subgraph Laplacian operators: On each let us 

define ^ocslI Laplacian matrix, computed from 

It is diagonalisable: 

\fk e {1, K} ( 12 ) 

with the diagonal matrix of sorted eigenvalues (A^^^ is 
the smallest): 

A«=diag(A^''\Af\...,A^;), (13) 

and the basis of local Fourier modes: 

= (14) 

and ^ with: 

p« = (15) 


(k) 

We normalize the with the norm: 

Vi G [i,Nk] llgfllp = |gP(j)r j = 1. (16) 

Note that in the specific case where p = 2 for this normaliza¬ 
tion, we end up with We will see that in this 

case, the filterbank simply codes for an orthogonal transform. 
We discuss the choice of p (usually 1 or 2) in Section IV-E. 
For each of size defined on the local subgraph 

_(k) 

let ql ^ be its zero-padded extension to the whole global graph: 
VfcG{l,K} ViG{l,fVfe} gf)=C'Wgf). (17) 

_(k) (k) 

Similarly p • ^ stands for the zero-padded extension of p • \ 

3) Analysis, synthesis and group operators: Considering 
the connected subgraph partition c, define Ni the maximum 
number of nodes in any subgraph: 

7Vi= max A/'/c. (18) 

k 

For any I G {1,..., TVi}, we note Xi the list of subgraph labels 
containing at least I nodes: 

V/ G {1,..., Ni} Xi = {ke {1, K} s.t Nk>l}. (19) 

For instance, as all subgraphs contain at least 1 node, Xi 
contains all the K subgraph labels. Also, necessarily: \Xi\ > 
\X2\ > ' " > \Xy^^ I (where |.| denotes the cardinality). By 
construction, we have \Xi \ = N. 

The family of analysis operators {©/ G contains 

Ni operators that generalize L and B of the Haar filterbank: 

V^G{l,...,fVi} ©;= 

This means that operator 0/ groups together all local Fourier 
modes associated to the l-th eigenvalue of all subgraphs 
containing at least I nodes. 

The family of Ni synthesis operators {11/ G 
reads: 

V(G ,iVi} n,= 

The family of group operators {Fli G also 

contains Ni operators defined as: 

v^G fVi} = 

= 0 if not. 

This means that fli groups together indicator functions of 
subgraphs containing at least I nodes. 

4) On the operators’ uniqueness: Operators as we have 
defined them are not unique if no further rules are enforced. 

f/c) f/c) 

In fact, for each eigenvector q- \ its opposite — q- ^ is also an 
eigenvector. Moreover, in the case of eigenvalue multiplicity, 
associated eigenvectors are not unique. To enforce uniqueness, 
any set of deterministic rules to extract eigenvectors will work. 
To solve the orientation issue, one may for instance decide to 
set the first non-zero coefficient of all vectors to be positive. 
For eigenvalues with multiplicity, we discuss a possible set of 
rules in Appendix B that guarantees uniqueness. 
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Analysis bl<^k 



Approximation: 
I (1) 

I (xi, Ai) I 

_petmlsj^ _ 

I (X 2 , A 2 )'^’ I 

I I 

I I 

I (X 3 , Asf’ ' 

I I 

I I 

1 (■■■) I 

I I 

( 1 ) 

I (XNi, Ani) I 


SyntA^sfs b/ocfc {IJ 



Fig. 2. Schematic representation of the analysis and synthesis blocks. 


C. The filterbank design 

1) Analysis block: Given a signal x defined on the graph 
whose adjacency matrix is A, one first needs to find a partition 
c in connected subgraphs. The maximum number of nodes in 
any subgraph of c is Ni (see Eq. (18)) and it determines the 
number of channels through which x will be analyzed. Then, 
from c, one constructs all operators and as described 
in Section IV-B. Finally, the signal x defined on the graph is 
decomposed, through the Ni channels, in Ni signals: 

VI e {l,...,7Vi} xi = &Jx, (23) 

each of them defined on a graph whose adjacency matrix reads: 

Ai = (24) 

By this formula, the convention is that there is no self-loop, 
i.e., for k = k', Ai{k, k) = 0. Also, adjacency matrices Ai for 
/ > 2 are only needed if one decides to cascade the filterbank 
on detail channels; here, the cascade will only be done on Ai 
(see Fig.3 and Section IV-D). 

2) A remark on the storage of structural information: An 
important question is whether the total amount of stored infor¬ 
mation pre- and post-analysis equal or not? In terms of signal 
information only (i.e., discarding the structural information), 
the total amount of stored information is equal on both sides 
of the analysis block as each of the downsampled signals xi 
is of size \Ii \ and \Ii \ = N. 

On the other hand, in terms of structural information (i.e. 
the information of the adjacency matrices), one needs to 
keep both the structural information pre- and post-analysis. 
Indeed, the post-analysis structural information is not enough 
to reconstruct the original graph (unlike the signal x who 
can be perfectly reconstructed from its approximation and 
details, as we will see in Section IV-C3). The amount of 
stored structural information therefore increases after analysis 
and this is -at least for now- an irreducible storage price to 
pay. This is a common downfall of all graph filterbanks yet 
proposed, e.g. [19]-[21], [26], [27]. Finding ways to critically 
sample both the structure and the signal defined on it is part 
of our ongoing research and is not in the scope of this article. 

In the following, all graph structures are stored in the form 
A = Aint + Aea^t, as this does not increase the amount of 
information (the number of links) but still subtly encodes the 
connected subgraph structure: indeed the partition c can be 
exactly recovered by detecting the connected components of 


Aint- In Narang et al. [19], [26], authors also need to keep an 
information equivalent to c: the bipartite graph decomposition 
of A and, for each bipartite graph, the information of the two 
sets of nodes. It is also the case in Shuman et al. [27]’s work, 
where authors need to keep the downsampling vector m. 

3) Synthesis block: One starts the synthesis with the list 

of signals TVi} structure information A = 

Aint + ^ext- We show here that x may be exactly recovered 
from that information. First of all, one extracts c from Aint, 
which in turn enables to compute all synthesis operators {Hi} 
following Sections IV-B2 and IV-B3. Moreover: 

Lemma 1. [Perfect reconstruction] x is perfectly recon¬ 
structed from {xi}i^^^ by applying successively each 

synthesis operator to its corresponding channel: 

Ni 

= (25) 

1=1 

Proof: Combining Eqs. (23) and (25), one has: 

Ni Ni Ni \Xi\ 

y^UiXi = {y2ni&J)x = 

1 = 1 1 = 1 1 = 1 j = l 

(26) 

is a matrix of size N x N, with non zero 
coefficients only for indices in subgraph Ti{j). In this non¬ 
zero block, it equals . One finally obtains: 

0 
0 

piK)QiK)T 

(27) 

the Identity, = , which ends the proof. ■ 

We show in Fig. 2 a schematic representation of the analysis 
and synthesis blocks of the proposed graph filterbanks. 

4) Critical sampling and biorthogonality: Starting with 

data X of size N, the analysis block provides vectors xi of 
size |//| V/ G ?Vi}. Storing all the Xi accounts for 

^i\di \ = N values: the filterbank is critically sampled. 

Furthermore, this filterbank is biorthogonal. Indeed, the 
analysis filter combining the Ni channels may be written 
as [©1 ©2 ... • Moreover, the overall synthe¬ 
sis filter reads [lIi 112 ••• n^Vi]- filterbank is 

biorthogonal as we have: 

[ill n2 ... n^J [©1 ©2 ... ^TvJ 

as shown by Lemma 1. When we choose p = 2 for the 
normalization of Eq. (16), this filterbank is orthogonal. 


Ni 

= 

1=1 


P(1)q(1)T q 

0 P(2)q(2)T 


D. The analysis cascade 

Filterbanks are easily organised in cascade, each level of 
which contains the analysis and synthesis operators defined in 
Section IV-C. Given an original graph of adjacency matrix A, 
a signal x defined on it, and a partition c of the graph, one 
may obtain the Ni analysis operators {©[^^} where the index 
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(x. A) 



Fig. 3. The first three levels of the analysis cascade. 


in parenthesis stands for the level of the analysis cascade. The 
cascade’s first level comprises of: 

v; e [l,Ni] and 

For each of these channels, one may iterate the same analy¬ 
sis scheme, thereby obtaining successive approximations and 
details of the original signal at different scales of analysis. 
Classically, and we will follow this approach in the following, 
one iterates the analysis scheme only on the approximation 
signal at each level of the cascade, as shown on Fig. 3. 
Considering the approximation signal at level (j) defined 
on the approximation graph coded by and a partition 

of this graph, one may obtain the A^j+i analysis operators 
{©p+i)}, define the next scale of analysis. Similarly, we 
note the associated synthesis operators. 

Note that the number of channels is not necessarily constant 
down the cascade. It is in fact adaptative: the number of 
channels Nj of the analysis block {j ) depends on the maximal 
number of nodes contained in the subgraphs given by 

1) A signal defined on a simple toy graph: Fig. 4 shows 
the analysis cascade on a toy signal defined on a simple graph 
of size 14. Let us take a close look at subgraph number 5 
of the original graph: it contains 2 nodes and the signal on 
each of its node is of same absolute value but of opposite 
signs. As expected, ccpp5), the approximation signal on the 
corresponding supernode at level (1) is null (average of the 2 
original values); and (5), the first detail signal is large: it is 
the difference between the 2 original values. Moreover, as this 
subgraph contains only 2 nodes, its local Laplacian does not 
have a third eigenvector: this subgraph does not participate to 
the second and third detail signals and its associated supemode 
does not appear in (ccs, nor {x/^^ 

The analysis cascade decomposes the original signal in 3 
detail signals (of sizes 5, 3 and 1) at level (1), 2 detail 
signals (of sizes 2 and 1) at level (2), 1 detail signal and one 
approximation signal (both of size 1) at level (3). From these 
7 downsampled signals of total size 14, one may perfectly 
reconstruct the original signal, using the synthesis operators 
defined in Section IV-C3. 

E. Atoms of analysis and the choice of normalization 

To study the effect of the filterbank, one may look at the 
dictionary of analysis atoms, and of recovery atoms. With 
additional assumptions, it could be wavelets. However, we will 
avoid using the term wavelet and prefer the more general term 
of “atom”, as they do not necessarily have wavelet’s properties: 
they are for instance not related to translation on the graph. 


(x. A) 



Partition in 


connected ':i 

subgraphs of Ai ... 

[N; 

>- = 3 ] 

(xi, Ai 

(X2, Azf! (X3. Ab)"’ 

Level (2) ^ ^ 

Partition in 

1 

connected ( 2 ) 1;' - 


subgraphs of Ai 


[N; 

3 = 2 ] 

(xi, Ai 

f (X2, A2)"’ 


Level (3) 


Fig. 4. An analysis cascade of a signal x defined on a toy graph A (top 
figure). On each node, the red vertical bar is proportional to the value of this 
node’s signal. Each edge’s width is proportional to its weight. Under the top 
figure is represented a partition in connected subgraph (see Section V): for 
clarity’s sake, one color is assigned to each subgraph. The largest subgraph 
contains Ni = A nodes: the first level of the cascade therefore contains Ni = 
4 channels. At level (1), we represent the approximation (cci, and 

the three details {{xi, }i —2 3 4 ‘ coarsened graphs, the 

color of each supernode corresponds to the color of A’s subgraph it represents. 
We then iterate the analysis on the successive approximation signals, until 
level (3), where the approximation signal (cci, Ai)(^) is reduced to a single 
node. The underlying orange tree is a guide to the eye down the analysis 
cascade. From the 6 detail and 1 approximation signals of each of its leaves, 
one may perfectly recover the original signal x. 


To each output of the analysis cascade (approximations 
and details at all levels) is associated an analysis atom. 
Approximation analysis atoms (“scaling function-like”) are 
associated to the first channel of each level: at level (j), 
they are the columns of upsampled back to the original 
graph’s size: 

$(i) ^ ©(1) X ... X X (28) 

Detail analysis atoms (“wavelet-like”) are obtained from all 
but the first channel at each level. More precisely at level (j), 
the detail analysis atoms associated to channel / 7^ 1 are the 
columns of 0^% upsampled back to the original graph’s size: 

= ©p) X ... X ©p“^^ X (29) 

The choice of the Lp norm in Eq. (16) is now dictated by 
the desired properties of the atoms. A first possible choice is 
p = 1, i.e. normalization in Li, as it is the only normalization 
that ensures that the detail analysis atoms have zero 

mean - a desirable feature to have atoms as close as possible 
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Fig. 5. The 14 analysis atoms of the proposed hlterbank for the graph represented at the top of Fig. 4. From left to right, and top to bottom, are listed the 13 
detail atoms, from small to large scales. The bottom right atom is the approximation atom (hrst channel at level (3) of the analysis cascade). A node’s color 
represents the atom’s value on that node, as indicated by the colorbar. A black node corresponds to a strictly null value: indeed, atoms are compact-support. 


level (j-1) level (j) 

xT’onAT’: xi onAi: 



Fig. 6. Approximation at level (j) of an analysis cascade of 

the signal-structure couple given two different partitions 

in connected subgraphs (represented by the dotted lines). The signal is 
represented by colors on the nodes. Subgraph k in the original graph is 
represented by supernode k in the coarsened graph. As seen in Section IV, 
the signal on supernode k is the signal’s average on the original subgraph k. 
Note the strong impact of the partition on the approximation signal. 

to a wavelet interpretation. Another possibility would be to 
normalize in L 2 (as for the Haar hlterbank). In this case, detail 
atoms do not have zero mean in general, however the energy of 
the modes is constant. In Section VI, the normalization will be 
application-dependent. The default normalization is with Li. 

In Fig. 5, we show the 14 analysis atoms corresponding to 
the analysis cascade of Fig. 4: one approximation atom (from 
the approximation channel at the last level of the cascade); 
and 13 detail atoms that represent the other channels. 

A property of the atoms is that their support is always 
compact: each is dehned and non-zero only on one subgraph. 
On the other hand, in the global Fourier domain, the atoms 
are exactly localized only if the decomposition in subgraphs 
corresponds exactly to different connected components of the 
whole graph (in this case, the global Fourier matrix is the 
concatenation of all local Fourier matrices). If not, the further 
away is the graph from this disconnected model, the less 
localized are the atoms in the global Fourier domain. 

V. Detecting a partition of connected subgraphs 

The proposed hlterbank explicitly integrates the graph struc¬ 
ture in connected subgraphs. A central question arises: how 
does one choose a particular partition c of the graph in con¬ 
nected subgraphs? The partition choice has a strong inhuence 
on what the hlterbank achieves, as shown in Fig. 6 where 
we compare the effect of downsampling for two different 
partitions on a toy graph. The practitioner has the choice 
among a wide variety of options to hnd such a partition: he or 
she could follow graph partitioning techniques of [38] or [39], 


or use graph nodal domains [40] - either very high frequency 
ones as in [27] or others — or any other solution... While the 
proposed hlterbank is well-dehned for any of these partitions, 
the hnal decision regarding the partitioning algorithm will 
depend on what the user wants the hlterbanks to achieve. 

In the following, we show applications for compression and 
denoising. We seek to typically transform the original signal 
into a sparser one after analysis. For that, we look for partitions 
that separate the graph into groups of nodes more connected to 
themselves than with the rest of the graph: they are known as 
communities. Indeed, as in image or video compression, we 
suppose that low-frequencies contain the useful information 
of the signal. Approximating a community of nodes, each one 
with its signal value, by a supernode on which is the average 
over the community is a way to keep such low-frequencies. 

A. Community detection procedure 

Literature is abundant on community detection (see the sur¬ 
vey [29]). To detect non-overlapping communities, we use the 
greedy Louvain method [41]. It maximizes (approximatively) 
over all the possible partitions c, the so-called modularity 
(see [29]), a well-known objective function that measures the 
quality of a partition in communities c, defined as: 

ij ^ '' 

where di = Aij and 2m = di. The Louvain method 
iteratively repeats two main phases, starting from an initial 
situation where each node is in its own community: 1) Select a 
node and group it with its adjacent node that causes the largest 
increase of modularity; do this sequentially with all other 
nodes, until no individual move can improve the modularity; 
2) Aggregate each community in a “supemode” and build 
a new adjacency matrix of this “supernode” graph. Phase 1 
is then applied to this new graph, and so on and so forth. 
The algorithm stops when phase 1 is not able to increase the 
modularity anymore. We modify this algorithm and have two 
different implementations: 

The SC (Small Communities) implementation. It consists 
in performing phase 1 only once: this implementation ensures 
that the partition separates the graph in small communities 
(typically smaller than 10 nodes). 

The LC (Large Communities) implementation. When 
performing the usual algorithm, a stopping criterion is added: 
the algorithm is stopped (if not already stopped thanks to 
the first criterion) before the size of the largest community 
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Fig. 7. Left: original smooth graph signal (sum of the first five global graph Fourier modes normalized by its maximum absolute value) defined on the 
Minnesota traffic graph. The vertical colorbar of this figure is valid for all graph signals represented on this figure. Top row: signal approximation on 
the graph approximation after each level {1) of the analysis cascade (with the Cosub SC implementation), each supernode being placed at the average 
position of the nodes of its subgraph. The horizontal colorbar on the bottom of each hgure corresponds to the weights of the links of the corresponding 
coarsened graph. Figures who do not have a bottom horizontal colorbar represent binary graphs. Lower row: for each of the successive approximations, we 
represent the upsampled reconstructed graph signal, obtained by setting the details to zero. 


becomes larger than a given threshold r. In fact, iterating both 
phases, communities become gradually larger; and recall that 
our proposal relies on the diagonalisation of the local Lapla- 
cian matrices, which has a cubic computation cost. In order to 
control computation time, we do not allow communities larger 
than the threshold, hereafter r = 1000 nodes. 

For comparison, in Section VI-B2, we will show some 
results obtained with another famous multiscale community 
detection algorithm, called Infomap [42]. With this algorithm 
also, one may define analog SC and LC implementations. 
Note on stochasticity: The Louvain and the Infomap algo¬ 
rithms are stochastic: they do not necessarily output the same 
partition at every run on the same data. This implies that the 
output of the analysis cascade of our filterbank may differ from 
one realisation to another (this is also the case of other methods 
such as the filterbanks based on bipartite graphs). Stochasticity 
is not an issue as synthesis operators are built according to the 
solutions found during the analysis: reconstruction is always 
perfect. For the results in Table I and Figures 10, 11 and 14, 
we show the median computed over 10 realisations. 

B. Choice of adjacency matrix 

When performing community detection, one may choose to 
use only the original adjacency matrix A as it is, or incorporate 
some information about the graph signal x to follow more 
closely its evolution. We explore two choices: 

CoSub, short for Connected Subgraphs, is based on simply 
applying the Louvain algorithm on the adjacency matrix A\ 

EdAwCoSub, short for Edge Aware^ Connected Subgraphs, 
takes the signal x into account for subgraph partitioning and 
modifies the adjacency matrix into 

(a3(i)-a3(j))^ 

= 0 ifAii,j)=0 

where ax = std({|cc(i) — (i ^ j means i neighbor 

to j in Q). This choice of ax is classical in the clustering 
literature [30]. The Louvain algorithm is then applied on Ax. 

^We use the term edge aware in relation to usage in the Signal processing 
community; the reader can think of it as “signal-adapted” if preferred. 


The obtained partition enables us to write A = Aint + Aext 
in both cases. Edge-awareness may also be implemented, as 
in [25], by adapting image segmentation methods to graph sig¬ 
nals; such an advanced comparison between edge-awareness 
methods is left for future work. In Section VI, we compare the 
4 implementations of the proposed filterbank: CoSub SC and 
LC, EdAwCoSub SC and LC; to methods from the literature. 

C. Complexity of the algorithm 

At a given level of the analysis cascade, computing the 
analysis atoms requires: i) to run the partitioning algorithm: 
the Louvain algorithm has a linear complexity 0{N) [41]; 
ii) the diagonalisation of the Laplacian associated to Ai^t, 
i.e. a block diagonal matrix containing as many blocks as 
there are detected communities. Given that the diagonalization 
of a matrix of size N costs 0{N^), the diagonalization of 
a block diagonal matrix containing K blocks of same size 
thus costs 0{N^ Overall, at each level of the cascade, 
computing the analysis atoms costs 0{N-\-N^ Typically, 
if K = N/a with a an average small number of nodes per 
community (see end of Sec. VLB for typical values of a), 
the complexity turns out to be + 1)^)- Cascading 

the analysis on all levels thus costs 0{a^N log N). This is 
to compare to the global graph Fourier analysis that costs 
0{N^). 

VI. Applications 

All the reported examples are computed using a developed 
Matlab toolbox that is available for download^. The compar¬ 
isons with methods from the literature use the implementations 
from the original authors, when they are available. 

A. Two illustrative examples 

1) An example of approximated graph signal: Fig. 7 
shows successive approximated signals {(cci, of 

a smooth signal defined on the Minnesota traffic graph [26], 
using the CoSub SC implementation. Notice how the last 
level’s approximated signal, even if small in size (12 nodes) 
still captures the original signal’s information remarkably well. 
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$( 1 ) 

(315 atoms) 


(315 atoms) 


^( 1 ) 

(35 atoms) 


^( 2 ) 

(62 atoms) 


(62 atoms) 




(25 atoms) 


^( 3 ) 

(13 atoms) 


(13 atoms) 




(2 atoms) 











Fig. 8. Selected analysis atoms of the filterbank applied to a 32 x 32 regular 
2 dimensional grid. Top line of figures: the top middle figure represents the 
graph Fourier transform of the image (top left) seen as a graph signal on the 
2D grid. Other lines of figures: each line represents a choosen set of analysis 
atoms, corresponding to a given channel of the filterbank. For instance, the 
second line represents the atoms i.e., the atoms corresponding to the 

first channel of the first level of the analysis cascade. The black lines on the 
first column’s image represents the partition of the graph in 315 subgraphs, 
which is, in this case, the output of the Edge-Aware SC implementation of 
our filterbank. To each subgraph is associated a local atom, by construction, 
such that we can represent all of them on the same global image. The second 
column’s figure represents all global Fourier transforms of all 315 atoms, 
and, superimposed with a thick black line, is their average. The third column 
represents the reconstruction of the original image if one keeps only the 
information that went through this analysis channel, and discards the rest. 



Fig. 9. Benchmark images. From left to right: cameraman, coins, synthetic. 



Fig. 10. Comparison of compression performance on the three benchmark 
images of Fig. 9 (from left to right: cameraman, coins, synthetic). Dotted 
lines represent edge-aware methods. As such, they can only be compared one 
to another and not to the non-adaptative methods represented as full lines. 


2) A small image: Images can be studied as graph signals 
defined on the two-dimensional regular grid (each pixel is a 
node, and each node has four neighbors), and may therefore 
be analyzed by the proposed graph-based filterbank. Consider 
the small 32 x 32 image of Fig. 8: it is a graph signal defined 
on a regular graph of size 1024. Its graph Fourier transform is 
represented on the same figure. We analyze this image with the 
EdAwCoSub SC algorithm and the rest of Fig. 8 represents a 
selection of atoms of the filterbank, shown both in the node and 
the global graph Fourier domain, and partially reconstructed 
images from the projection of the original image on these 
atoms. For the interested reader, a dedicated PDF file in our 
Toolbox shows all 1024 atoms. Note that the support of the 
subgraphs are clearly impacted by edge-awareness. We see 
that each atom is compactly supported (in the node domain) 
and only (very) approximately localized in the global graph 
Fourier domain. Moreover, we see how, within a given level 
(j), the mean frequency of increases as I increases. The 
cause of the frequency delocalisation is that regular grids are 
not decomposable in a sum of almost disconnected subgraphs: 
the local Fourier modes on which we base our design are 
therefore far from localized in the global Fourier domain. 
In fact, regular grids are a typical structure for which our 
method (and the graph partition in communities) is not very 
appropriate. Nevertheless, we still show results on images for 
pedagogical purposes and in order to compare performance 
with other methods from the literature. 


B. Graph signal reconstruction via non-linear approximation 

One of the use of classical filterbanks is compression. 
The main idea relies on the fact that natural signals are 
approximately smooth at different scales of analysis, and have 

^URL: http://perso.ens-lyon.fr/pierre.borgnat/Codes/CoSubFBtoolbox.zip 
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therefore a sparse representation on filterbanks’ atoms. One 
may thus transform the signal with the filterbanks, keep the 
low-pass coefficients and a fraction of high-pass coefficients 
while setting the others to zero, and still obtain a decent 
reconstruction of the original signal. In the following, we apply 
this non-linear approximation (NLA) scheme on images and 
on the Minnesota traffic graph. 

Typical filterbank comparisons using NLA look at recon¬ 
struction results after three levels of analysis. In our case, as 
we do not know beforehand in how many subgraphs the par¬ 
titioning algorithm will cut the graph, we cannot predict how 
many low-pass coefficients will be left after a given number 
of levels of the analysis cascade. Thus, for a comparison with 
other methods, and for a given compression ratio^, we will 
compute all non-linear approximations corresponding to all 
different levels of the cascade, and keep the level for which 
the reconstruction result is the best. 

1) Reconstruction of images: Consider for instance the 
benchmark image cameraman shown in Fig. 9 (left). Its size 
is 256 X 256 {N = 65536 is the size of the associated graph 
signal). Table I compares the reconstruction details after NLA 
of CoSub LC and SC, EdAwCoSub LC and SC, to the classical 
image filterbank CDF 9/7, the Graph Bior filterbank [26] 
with Zero DC, graphBior(6,6) filters and Gain Compensation 
(GrBior); and the same Graph Bior filterbank but including 
edge-awareness [25] (EdAwGrBior). Also, Fig. 10 recaps the 
PSNR of reconstruction for each of the three benchmark 
images of Fig. 9. We see that our method, without edge- 
awareness, does not perform as well as GrBior. This is due 
to the fact that our method is not best suitable to 2D grids as 
they do not have a natural structure in communities and, on the 
contrary, GrBior is best suitable to bipartite graphs, of which 
2D grids are an example. On the other hand, when adding 
edge-awareness, the graph becomes more structured and we 
obtain results similar to EdAwGrBior. 

Note on the LC implementation. We see here that the best 
reconstruction for the LC implementation is always obtained 
after only one level of analysis. In this case, the filterbanks 
may hardly be seen as a multiscale analysis, but rather as 
a graph windowed Fourier transform, where the window is 
simply an indicator function on each subgraph. Note that this 
window changes from one subgraph to another and it is hence 
different from the proposition of [13], [14] for windowed 
graph Fourier transform. To observe a multiscale analysis 
with the LC implementation, one needs to either decrease the 
threshold r or increase the graph’s size. 

2) Reconstruction performance for graph signals: The 
graph signal model underlying the NLA scheme is that graph 
signals should be smooth with respect to the topology on 
which they are defined. For instance, let us consider the graph 
signal of Fig. 7 (left): it is by construction smooth with respect 
to the underlying graph as it is the sum of the first five 
eigenvectors of its Laplacian matrix. We compare in Fig. 11 
(left) the reconstruction performance for our filterbank imple¬ 
mentations, to Shuman’s Laplacian pyramid filterbanks [27]. 

^the compression ratio is defined as the ratio of the size of the original data 
over the size of the compressed data 
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Fig. 11. Left: compression results of the graph signal of Fig. 7, and (black 
line) a Gaussian random signal of same energy defined on the same graph. 
Right: same comparison using the Infomap clustering algorithm rather than the 
Louvain algorithm to hnd partitions in connected subgraphs (see Section V-B). 
Results with the random signal are averaged over 10 realisations. Dotted lines 
represent edge-aware methods. As such, they can only be compared one to 
another and not to the non-adaptative methods represented as full lines. 


This method was not originally written with edge-awareness, 
but one can simply consider (as in Eq. (31)) instead of A 
to make it edge-aware and define what we call the edge-aware 
Laplacian pyramid method (EdAw Lap. Pyr.). Also, up to our 
knowledge, GrBior filterbanks have only been implemented for 
one-level cascades on arbitrary graphs, which explains why 
we do not consider them here. The full black line on the 
same Figure shows the performances for a random Gaussian 
signal of zero mean and variance 1, normalized to have the 
same energy as the smooth signal (all methods collapse on the 
same black line). As expected, random signals are dense on 
any analysis atoms, and reconstruction is comparatively poor. 
Moreover, we see that our proposed filterbanks really have an 
edge for signals who are smooth compared to the community 
structure of the underlying graph. On the right of Fig. 11 
are represented the performances obtained with the Infomap 
algorithm rather than the Louvain algorithm (see Section V-B). 
In this particular case, performances with Infomap are better. 
Empirically, we find that using the Louvain algorithm or the 
Infomap algorithm yields in general similar results. 

Note on the typical size of communities: In this Minnesota 
example (resp. cameraman example), the typical community 
size of the first level of the cascade is 2 (resp. 2) for CoSub 
SC, 5 (resp. 5) for EdAwCoSub SC, 80 (resp. 200) for CoSub 
LC, and 40 (resp. 100) for EdAwCoSub LC. 


C. Application in denoising, on the Minnesota traffic graph 

Another application of filterbanks is denoising. We consider 
first a piece-wise constant graph signal (that has only two 
possible values: -fl and -1) defined on the Minnesota traffic 
graph, as shown in Fig. 12a; so as to compare our proposition 
with previously published methods. We corrupt this signal with 
an additive Gaussian noise of standard deviation cr. Fig. 12b 
shows such a corrupted signal with cr = 1/4. We then attempt 
to recover the original image by computing the first level of 
the analysis cascade, and reconstructing the signal from all 
low-pass coefficients and thresholded high-pass coefficients 
having absolute value higher than T = 3cr. In order for such a 
thresholding scheme to be justified for denoising, the energy of 
coefficients associated to noise should have a constant variance 
in all the details after analysis. For that, we use here a L 2 
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Compr. ratio 

5 

26 

54 

96 

lev 

#LP 

#HP 

PSNR 

lev 

#LP 

#HP 

PSNR 

lev 

#LP 

#HP 

PSNR 

lev 

#LP 

#HP 

PSNR 

CDF 9/7 

3 

1024 

12083 

37.1 

3 

1024 

1497 

25.7 

4 

256 

958 

22.0 

4 

256 

427 

20.3 

GrBior 

6 

16 

13091 

36.9 

5 

64 

2457 

26.5 

6 

16 

1198 

24.0 

6 

16 

667 

22.4 

CoSub LC 

1 

282 

12825 

34.6 

1 

274 

2247 

25.5 

1 

273 

941 

23.1 

1 

279 

404 

21.5 

CoSub sc 

3 

2017 

11090 

35.3 

4 

282 

2239 

21.5 

4 

281 

933 

19.3 

4 

282 

401 

18.2 

EdAwGrBior 

5 

64 

13043 

39.0 

6 

16 

2505 

29.3 

6 

16 

1198 

27.0 

6 

16 

667 

25.4 

EdAwCoSub LC 

1 

478 

12629 

37.3 

1 

486 

2035 

29.3 

1 

486 

728 

27.4 

1 

486 

197 

25.9 

EdAwCoSub SC 

2 

2584 

10523 

39.1 

3 

430 

2091 

28.7 

3 

422 

792 

26.8 

3 

439 

244 

25.7 


TABLE I 

Comparison of the compression performance with other methods on the benchmark image cameraman shown in Fig. 9. 



Fig. 12. a) Piece-wise constant signal on the Minnesota traffic graph (only 
2 values: ±1), and b) its corrupted version with an additive Gaussian noise 
of std (7 = 1/4. The four other figures are denoised signals after a one- 
level analysis and hard-thresholding all high-pass coefficients with T = 3a. 
c) EdAwGrBior; d) EdAw Lap. Pyr.; e) CoSub LC; f) EdAwCoSub LC. 



Fig. 13. a) Smooth signal on the Minnesota traffic graph (same as in Fig. 7), 
and b) its corrupted version with an additive Gaussian noise of standard 
deviation a = 1/4. The four other figures are denoised signals after a one- 
level analysis and hard-thresholding all high-pass coefficients with T = 3a. 
c) EdAwGrBior; d) EdAw Lap. Pyr.; e) CoSub LC; f) EdAwCoSub LC. 


normalization of the local Fourier modes (rather than Li) for 
this denoising experiment (see the discussion in Section IV-E). 

Fig. 12 compares results obtained with EdAwGrBior and 
EdAw Lap. Pyr. filterbanks to our proposition, for cr = 1/4. 
Eig. 14 (left) summarizes SNR results for different values of a. 
These results may be compared to those obtained by Sakiyama 
et al. and summarized in Table 5 of [20]. We study also the 
denoising on the smooth signal of Eig. 7. Results are shown 
in Eig. 13 for cr = 1/4 and summarized in Eig. 14 (right) for 
different values of cr. 

All four of our implementations outperform GrBior. More¬ 
over, CoSub LC and the Laplacian pyramid obtain similar 
results; our method performing slightly better at high noise 




-i»-Noisy 

GrBior 

-<-Lap pyramid 
-•—Cosub LC 
-e-CoSub SC 
-❖-EdAwGrBior 
-★-EdAwLap pyramid 
-X- EdAwCoSub LC 
-EdAwCoSub SC 


Fig. 14. Comparison of the denoising performance for (left) the piece-wise 
constant signal of Fig. 12a and (right) the smooth signal of Fig. 13a; vs. the 
standard deviation of a Gaussian corrupting noise. Both signals are normalized 
such that their maximum absolute value is one. Results are averaged over 
ten realisations of the corrupting noise. Dotted lines represent edge-aware 
methods. As such, they can only be compared one to another and not to the 
non-adaptative methods represented as full lines. 


level. Edge-awareness helps in the case of the piece-wise 
constant signal and not so much for the smooth signal. 


VII. Conclusion 

While previous methods are based on global filters defined 
in the global Eourier space of the graph, we defined local filters 
based on the local Eourier spaces of each connected subgraph. 
Thanks to this paradigm, a simple form of filterbanks is 
designed, that one could call Haar graph filterbank. 

We first illustrated this for compression on images, mainly 
for pedagogical and state-of-the-art comparison purposes. In 
fact, without edge-awareness, our proposition is not really 
appropriate for such regular structures. Edge-awareness, on the 
other hand, by giving structure to the network raises perfor¬ 
mance to the state-of-the-art. The improvement over existing 
methods becomes truly apparent for irregular graphs for which 
a community structure exists. Existence of communities is a 
very common, if not universal, property of real-world graphs; 
and our filterbanks rely on this particular organization of com¬ 
plex networks. Eor such graphs, our proposition outperforms 
existing ones on non-linear approximation experiments and 
equals state-of-the art on denoising experiments. 

Within this framework, future work will concentrate on 
extending the local filters to more sophisticated filters, and on 
finding ways to critically sample jointly the graph structure 
and the graph signal defined on it. 













































12 


Appendix 

A. Analysis, synthesis and group operators on a toy example 

Consider the trivial graph composed of five nodes shown in 
Fig. 15. Three of them form a closed triangle. The other two 
are connected. The triangle and the pair are connected to each 
other with only one link. Its adjacency matrix A reads: 


A = 



(32) 


Fig. 15. A simple toy graph to illustrate our operators and notations. 


Til = 1 0 ri2 = 1 0 Tig = 1 (38) 


Here, the approximated graph’s adjacency matrix reads: 


- 1 

1 

0 ■ 
0 


- 1 

1 

0 0 

_1 


- 1 - 
1 

1 

0 

_ 0 

0 

1 

1 

^^2 = 

1 

0 

_ 0 

0 

1 

1 

ris = 

1 

0 

_ 0 _ 


In this example, we consider the partition that separates the 
triangle (subgraph from the pair (subgraph 


^ A^xt^i — 


0 1 
1 0 


(39) 


c= (1,1,1,2, 2). (33) 

Therefore T^^^ = (1,2,3) is the list of the nodes composing 
the triangle, and T^^^ = (4,5) is the list of the nodes 
composing the pair. The subsampling operators associated to 
and read: 


Let us add a second level of analysis where = (1,1): 
we group together the two nodes of the approximated graph. 
The second-level approximated graph is thereby reduced to 
one node, and the analysis operators are: 

©f) = [ 1/2 1/2 and ef ^ = [ 1/2 -1/2 . (40) 


- 1 

0 

0 ■ 


■ 0 

0 - 

0 

1 

0 

, = 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 


1 

0 

0 

0 

0 


0 

1 


(34) 


The intra- and inter- subgraph adjacency matrices read: 


- 0 

1 

1 

0 

0 - 


- 0 

0 

0 

0 

0 - 

1 

0 

1 

0 

0 


0 

0 

0 

0 

0 

1 

1 

0 

0 

0 

^ext — 

0 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 


_ 0 

0 

0 

0 

0 _ 


The local Laplacian operators and read: 


C 


= 

int 



(35) 


In the following, we choose the Li normalisation for the 
One may diagonalize and obtain and 


1/3 

1/2 

1/4 

^p(l) = 

1 1 

2/3 

1/3 

-1/2 

1/4 

1 -1 

2/3 

1/3 

0 

-1/2 


1 0 

-4/3 _ 


as well as = diag(0,3,3). One may also diagonalize 
obtain A*^^^ = diag(0,2) as well as and 


Q 


(2) = 

r 1/2 

1/2 1 

p (2) _ 

■ 1 1 ■ 


1 1/2 

-1/2 J 

5 ^ — 

1 -1 


(36) 


Moreover, A^i = 3, A ^2 = 2, therefore there are A^i = 3 
analysis, synthesis and group operators: 



- 

1/3 

0 

- 


r 1/2 

0 - 


- 

1/4 1 



1/3 

0 



-1/2 

0 



1/4 

01 = 


1/3 

0 


©2 = 

= 

0 

0 

©3 = 


-1/2 



0 

1/2 


0 

1/2 



0 


- 

0 

1/2 J 


0 

-1/2 J 


- 

0 


- 1 

0 - 


r 

1 

0 - 


- 2/3 

n 



1 

0 


- 

-1 

0 


2/3 



Hi = 


1 

0 

n2 = 


0 

0 

n3 = 


-4/3 


(37) 


0 

1 



0 

1 


0 




_ 0 

1 



0 

-1 


0 




Therefore, the 4 detail analysis atoms read: 

^ 2 ^^ = = 0^^^ and 

= 0^^^ X 0^^^ = [ 1/6 1/6 1/6 -1/4 -1/4 . 

(41) 

The approximation analysis atom at level (2) reads: 

#(2) ^ 0(1) X 0p)T ^ 1 |- 1 1 I I 1 ]T (42) 

If using a L 2 normalization, ^2 would read: 

= [ 1/V6 1/V6 1/V6 -1/2 -1/2 . ( 43 ) 


B. A proposition for uniqueness of the operators 

To enforce uniqueness of the graph Fourier basis in the 
case of eigenvalue A having multiplicity m > 1, a possible 
rule can be set as follows. Consider the first vector of this 
eigenspace. All we know is its orthogonality with vectors 
of all other eigenspaces, i.e. N — m vectors. We decide to 
arbitrarily force its last m coefficients to zero, and then find 
the unique N — m coefficients that respects orthogonality with 
other known vectors and proper normalization. Note that, if at 
least one of the vectors of the other eigenspaces have non¬ 
zero values only on these last m coefficients, we then look 
for the set of m coefficients closest possible to the last one 
such that uniqueness is guaranteed. For the second vector, it 
has to be orthogonal to the already decided N — m^l vectors: 
we arbitrarily force its m — 1 last coefficients to zero and find 
the unique set of its set of coefficients thanks to orthogonality. 
And so on and so forth up to the multiplicity m. 

Note that, for practical implementations, classical functions 
for eigenvector computation (for instance eig or svd in 
Matlab) empirically output the same choice of eigenvectors 
when run on two exactly identical inputs, even when there are 
eigenvalues with multiplicity. 
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